
• 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



\ (i) APPLICANT: Ponath, Paul D. 

11 Ringler, Douglas J. 

Jones, S. Tarran 
Newman, Walter 
Saldanha, Jose 
Bendig, Mary M. 



(ii) TITLE OF INVENTION: 



HUMANIZED IMMUNOGLOBULIN REACTIVE WITH a4f7 
INTEGRIN 



(iii) NUMBER v OF SEQUENCES: 63 



(iv) CORRESPO: 

(A) ADD 

(B) STREE 

(C) CITY 

(D) STATi 

(E) COUNT 

(F) ZIP: 




M H fiS„, Brook, Smith & Reynolds, P.C. 

to Militia Drive 
.ngton 
sachusetts 

. Ufi A 
173 \ 

(v) COMPUTER READABLE FORM: 
11 (A) MEDIUM TYPE: fW>Y dxsk 

B COMPUTER: IBM DOS 
JSl ESS SS^Sff-W. Ver S ion #1 -30 

(vi) «™^S« , ! h700 f 737 

B FILING DATE: 15-AUG-1996 v 
(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: v 
( (A) NAME: Brook, David E. \ 

B REGISTRATION NUMBER: 22,592 \ 
\V) REFERENCE/DOCKET NUMBER: LKSSS^IO 

(ix) TELECOMMUNICATION INFORMATION: \ 
( ' (A) TELEPHONE: (617) 861-6240 
(B) TELEFAX : (617) 861-9540 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

1 ' (A) NAME/KEY: CDS 

(B) LOCATION: 13.. 444 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TGGTATCAAC 


AGCT ACAAv* x 


60 


TTGTGAAGCC 


TGGGACTTCA 


120 


GCTACTGGAT 


GCACTGGGTG 


180 


TTGATCCTTC 


TGAGAGTAAT 


240 


CTGTAGACAT 


TTCCTCCAGC 


300 


CTGCGGTCTA 


CTATTGTGCA 


360 


GGGGTCAAGG 


CACCTCAGTC 


420 


; YYSBDNNCCC 


YKGRWSCYTG 


480 






494 



GNNGAAGCTT GGGA 

(2 ) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 144 anu.no acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:2: 

J xa. *a. Xaa Xaa xaa x,e xaa H. L~ v., ~ - ~ - 



nn Pro Glv Ala Glu Leu Val Lys 
Val His Ser Gin Val Gin Leu Gin Gin Pro Gly A ^ 

20 " 
Pro aly Thr Ser Val Lys Leu Ser Cys Lys Gly Tyr Gly Tyr Thr Phe 

Th r ser Tyr Trp Met His Trp Val Lys Cln Aro Pro Oly - Oly Leu 

Glu "p He Oly Olu lie Asp Pro Ser Olu Ser Asn Thr Asn Tyr Asn 

l Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp He Ser Ser Se, 

Met Tin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 



Thr Ala Tyr Met i«u — 

100 

^ ^ cys Ma « r «, Ty, M, 01, ««P -P ffi '» 



115 

^ Ttp „, 01 „ «, », J- «! - va, se, se, Ma »r 

130 1Ja 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 18.. 428 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
T „CO .CTCCCO » OOJ TOO « « «» « * « ™ ™ 



l 5 



2 s s s s a a a a a a = a i s s 

15 ' ■ 20 

s a s c a a a a a a a = = - K s 

30 35 

- S B B B B B 5 S S i = - = = 

45 50 

S S SS S B S S S B B 5 S 5 B S 1 

60 65 
AAT ACT AAC TAC AAT CAA AAA TTC AAG GGC AAG GCC ACA TTG ACT GTA 
Asn Thr Asn Tyr Asn Gin Lys Phe Lys Gly Lys Ala Tnr i. ^ 
80 Bb 

S S B S B S S B = = = = = £ * = 

b b s s s a a i a a a a i a aa 

110 115 

Bsasaaaaassass 



98 



146 



194 



242 



290 



338 



386 



428 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4s 

« «, « P ~ - ™ To val ser ' hr "* T " ~ 

Val Hi. S„ «. Val .1. ». «. «• »» "V ° l " IS 

20 zo 

m .1, « ~ W - ~ ^ * G « ^ ^ 

ser t" Trp M.t Hi. »l * «- « «* 01 ° «* 

Glu C Ue «, «. .i. «- - «- « Thr ~ v "S 

"„ P.. L„ «j W - T„r T §S vai MP i« « ■« « 
Tht ^ Iyt Met oi„ ~ - ~ - thr Ser «. »P ffi - " l 

Iyr ^ c, T. «, « y «» Sg »» ol * " p Mp 31 " a " e *" 

115 

Ty r Trp Gly Gin Gly Thr Ser Val Thr 

130 1Ja 
(2 ) INFORMATION FOR SEQ ID NO: 5: 

fi\ SEQUENCE CHARACTERISTICS: 
1 ' (A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

K (A) NAME/KEY: CDS 

(B) LOCATION: 16.. 435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGATTACTAG TCGAC ATG AAG TTG CCT GTT AGG CTG TTG GTG CTT CTG TTG 

140 145 



ss s s s s = s s s s i a s a s i 

150 155 

_ m rTT TCT ATC TCT TGC AGG 

s 2 s - i - - " " i ?n s set s - 
s £ ss s s s £ s i ss £ s - i - " 



51 



99 



195 
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435 

495 
535 



TAC CTC « U. COT CGC CAG g « CAG £ CJC «C TAT - « 

Tyr Leu His Lys Pro Gly Gin ser m 21Q 
200 2U3 

^ ar-r TTC AGT GGC AGT GGT TCA 

S E S S S S fj SS £ 5 A ~ - - - ~ 

s s s s s = - s = s i - 5 ™ " p a 

230 235 

GGA ATG TAT TAC TGC TTA CAA CGT ACA CAT CAG CCO TAC ACG TTC CCA 
Gly Met Tyr Tyr cy» Leu oln Gly Thr bi^ 260 

GGG GGG ACC AAG Z ATA AAA CGC GCT CAT CCT CCA CCA ACT GTA 

Gly Gly Thr Lys Leu Olu He Lys Arg »ia * 2?5 

TC CATCTTCC C^ATCCAC TAAGCTTGGG AATCCATATO ACTAGTACAT CCTCTAGAGT 
CGACCTGCAG GCATGCAAGC TTCCCTATAG TGAGTCGTAT 

(2) INFORMATION FOR SEQ ID NO: 6: 

<ii SEQUENCE CHARACTERISTICS: 

1 (A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Met Lys Leu Pro Val Arg Leu Leu Val Leu Leu Leu Phe Trp lie Pro 

Val Ser Gly Gly ^ Val Val Val thr Gin Thr Pro Leu Ser Leu Pro 

20 " 
val s.r Phe Cly Aep .» v.l « «■ ■« <*° «• II 
Leu Ala J. ser Tyr cly « Thr Tyr Leu ser Trp Tyr I- HI. Ly. 
Pro cly Cl„ ser Pro cl„ Leu Leu He Tyr cly U. Ser A.h Ar g Phe 
W Cly val Pro Asp Ar, Phe S.r cly ser cly ser cly Thr Aep Phe 

Thr Leu Lye He si Thr He Lys Pro Clu A.p Leu cly Met Tyr Tyr 

100 lu:> 
eye Leu Gin cly Thr HI. cln Pro Tyr Thr Ph. Cly cly cly Thr Lye 

115 ^ U 
L eu Glu lie Lys Arg Ala Asp Ala Ala Pro Thr Val 
130 1J3 



243 
291 
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INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

A8 p Val Val Val Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Phe Gly 

Lp Gin val Ser He Ser Cys Arg Ser Ser Gin Ser Leu Ala Lys Ser 

20 25 
Tyr Gly Asn Thr Tyr Leu Ser Trp Tyr Leu His Lys Pro Gly Gin Ser 

Pro Gin Leu Leu lie Tyr Gly He Ser Asn Arg Phe Ser Gly Val Pro 

50 55 
Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys lie 

Ter Thr lie Lys Pro Glu Asp Leu Gly Met Tyr Tyr Cys Leu Gin Gly 

85 9U 
Thr His Gin Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 

100 105 



2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

A8P He val Met Thr Gin Ser Pro Leu Ser Leu Pro Val Thr Pro Gly 
Glu Pro Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Leu His Ser 



20 25 



Asn Gly Tyr Asn Tyr Leu Asp Trp Tyr Leu Gin Lys Pro Gly Gin Ser 
Pro Gin Leu Leu lie Tyr Leu Gly Ser Asn Arg Ala Ser Gly Val Pro 



50 
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MP n. ~ olv - olv ~ «* - V - - - - S- 

L «. v., «. g. «. - - - y ~ * c,< Het » " a 

MU Olh Thr « «- - «, 01. 01, ~ ^ V- «S '» - 

) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xU MOO OBSCKXPTXOH.. SE 0 » 
01„ v,l .1. U. Oln ol„ Pro 01, M» olu - v.! 



L »x - ~ - r 015 Thr phe » ser *" 

20 

Trp „« ... « ~ «. «■ »? - oi ° oly sr 01u Itp Ile 

01y 01 U l MP « ~ «. ~ - ~ » "* 

w . II ^ - - - ~ - - - ?r s " s " Thr " a 3 1 

1 ol„ M u « ~ - Thr « .1. «P ~ »* - - If - 

Ui «, 0* ^ »p .»» - a * sla Ile MP »° ICP ° ly 

01» oly Thr ser val Thr v.l sjr s.c 

115 

(2 ) INFORMATION FOR SEQ ID NO: 10: 
M, SEQUENCE CHARACTERISTICS: 
( (A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



-125- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Gin Val Gin Leu 



Val Gin Ser Gly Ala Olu Val Lys Lys Pro Gly Ala 

10 A3 



ser v.l Ly. v.1 ser cys Lye Ala ser Gl, Tyr Thr Phe Thr ser Tyr 

20 25 
Ala Met Hie Trp Val Ar, 01" «• "° Ar, Leu olu Trp Hat 

01, Trp lie A.„ Ala 01, M. 01, A.n Thr Ly. Tyr ser oln X.,. Phe 

oin «» Ar, val Thr lie Thr Arg Aep Thr ser Ala ser Thr Ala Tyr 

L Olu Leu ser ser Leu Ar, ser olu A,p Thr Ala val T,r Tyr c,= 

85 9U 
»1. AT, Oly 01, Tyr Tyr Oly Ser Oly Ser Aen T,r Trp 01, oln 01, 



100 105 

Thr Leu Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 11: 

m SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..396 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

S 5 = S i 5 = = S g = = = - 1 5 



S = S 5 5 S S S S oS S S = = = S 

160 165 
GTC AGO TTT GGA GAT CAA GTT TCT ATC TCT TGC AGG TCT AGT CAG AGT 
Val Ser Phe Gly Asp Gin Val Ser He Ser Cys Arg se & 
175 180 

~r.r. »»n irr TAT TTG TCT TGG TAC CTG CAC AAG 

£ S £e K Tyr °o5, E S Tyr Leu ser Trp Tyr Leu „L Lye 

190 195 
CCT OOC CAO TCT CCA CAO « CTC .ATC TAT COO ATT TCC AAC AOA TTT 

Pro Gly Gin Ser Pro Gin Leu Leu He Tyr Gly ix 22Q 



48 



96 



144 



192 



240 
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= s a s i s = s s i " s - s a s 
s = s s s s s 5 i s ss = s s s ss 

240 245 

^ m „ r TAC ACG TTC GGA GGG GGG ACC AAG 

5! S S £ E SS « ~ S «. «, «, «, w . 

255 260 



CTG GAA ATA AAA 
Leu Glu He Lys 
270 



(2) INFORMATION FOR SEQ ID NO: 12: 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Leu Pro Val Arg Leu Leu Val Leu Leu Leu Phe Trp II. Pro 

Val Ser Gly Gly Asp Val Val Val Thr Gin Thr Pro Leu Ser Leu Pro 

20 25 
val ser Ph. oly MP 01" V.1 ser He ser cy. Arg ser ser 01. ser 

L.u »1. Z s.r Tyr oly « Thr Tyr Leu s.r Trp Tyr Leu Hi. Ly. 

50 55 
Pro Gly Gin Ser Pro Gin Leu Leu lie Tyr Gly He Ser Asn Arc, Phe 

Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe 

85 9U 
Thr Leu Ly. lie s.r Thr lie Ly. Pro olu h. P Leu oly Met Tyr Tyr 

100 105 
cy. Leu Oln Oly Thr Hi. Oln pro Tyr Thr Phe Oly Oly oly Thr Ly. 



115 

Leu Glu He Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



288 
336 
384 
396 
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,xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
OATATTGTGA TGACTCAGTC TCCACTCTCC CTOCCCCTCA CCCCTGGAGA CCCOGCCTCC 
ATCTCCTGCA GGTCTAGTCA GACCCTCCTC CATAGTAATG GATCAAACTA TTTGGATTGG 
T ACCTGCAGA AGCCAGGGCA GTCTCCACAG CTCCTGATCT ATTTGGGTTC TAATCCCGCC 
TCCGGGGTCC CTGACAGGTT CAGTGGCACT GGATCAGGCA CAGATTTTAC ACTGAAAATC 
AG CAGAGTGG AGGCTGAGGA TGTTGGGGTT TATTACTGCA TGCAAGCTCT ACCAACTCCT 

CAGACGTTCG GCCAAGGGAC CAAGGTGGAA ATCAAA 

(2) INFORMATION FOR SEQ ID NO: 14: 

<i\ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base parrs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

* (A) NAME /KEY: CDS 

(B) LOCATION: 1.-420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATG GGA TGG AGC TGT ATC ATC CTC TTC TTO GTA TCA ACA GCT ACA JH 
Met Gly Trp Ser Cys He lie Leu Pne l. M5 
135 14U 



s s s s s a = s ss s s s s s s s 

150 155 

_ rrT T rt GGC TAC ACC TTC 

£ SS £ E - S S S S E i ~ * ~ E! 

165 170 

acc ago tac tgg atg cac tgg gtg £ « jj» « •» Jg S 

Thr s«r Tyr Trp M«t His Trp val Lys Gin «rg 195 

GAG TGO ATC GGA Z ATT •» OCT TCT GAG ACT AAT ACT AAC TAC AAT 

Glu Trp He Gly Olu He Asp Pro Ser eiu 21Q 

200 ZU3 
CAA AAA TTC AAG GGC AAG CCC ACA TTG ACT GTA GAC ATT TCC TCC AGC 
Gin Lys Phe Lys Gly Lys Ala Thr Leu Tnr v ^ 

ACA GCC TAb ATG CAG CTC AGC AGC CTC ACA TCT GAG GAC TCT GCG GTC 

Thr Ala Tyr Met Gin Leu Ser Ser Leu ^ 
230 2Jb 



60 
120 
180 
240 
300 

336 



48 



96 



144 



240 



288 



336 
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TAC TAT TGT GCA AGA GGG GGT TAC GAC GGA TGG GAC TAT GCT ATT GAC 
gS Sr Sb aS Arg Gly Gly Tyr Asp Gly Trp Asp Tyr Ala lie Asp 
245 250 " b 

TAC TGG GGT CAA GGC ACC TCA GTC ACC GTC TCC TCA 
Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser 
y 265 270 



384 



420 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ser Thr Ala Thr Ser 
1 5 10 

Val His Ser Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys 
20 25 30 

Pro Gly Thr Ser Val Lys Leu Ser Cys Lys Gly Tyr Gly Tyr Thr Phe 

* Af\ 45 



35 



Thr Ser Tyr Trp Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu 

55 60 



50 



Glu Trp lie Gly Glu lie Asp Pro Ser Glu Ser Asn Thr Asn Tyr Asn 

65 70 75 

Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp lie Ser Ser Ser 



Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 



100 



Tyr Tyr Cys Ala Arg Gly Gly Tyr Asp Gly Trp Asp Tyr Ala lie Asp 
US 120 125 

Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser 
130 135 1 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..414 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Trr rTT TTT CTT GTG GCT ATT TTA AAA GGT 

S SX S"S S S S £ 2. «i »a n. u. ig oi, 

145 150 

s a si s 52 a s s a = s s s s 5 £ 

160 165 

s S S = C 5 s I 5 " s = i - s s 

s s i s a s.s is 5 s s i - - - = 

190 195 
GAG TGG ATG GGA TGG ATC AAC GCT GGC AAT GGT AAC ACA AAA TAT TCA 
Glu Trp Met Gly Trp He Asn Ala Gly Asn Gly Asn Thr *.y y ^ 
205 210 

„^ rTr ACC ATT ACC AGG GAC ACA TCC GCG AGC 

2° n ^ S S5 S? S " £ £ t S5 «, »p m g. « 

225 230 
ACA GCC TAC ATG GAG CTG AGC AGC CTG AGA TOT GAA GAC ACG GCT GTG 
Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Tnr 
240 245 

« ^ *™ rm OCT TAC TAT GGT TCG GGG AGC AAC TAC TGG 
TAT TAC TGT GCG AGA GGA GGT TAC TAT GGT ^ ^ ^ ^ 

Tyr Tyr Cys Ala Arg Gly Gly Tyr lyr y 265 

255 2bU 

GGC CAG GGA ACC CTG GTC ACC GTC TCC TCA 
Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
270 275 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Glu Phe Gly Leu Ser Trp Leu Phe Leu Val Ala He Leu Lys Gly 

1 5 10 

Val Gin Cys Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys 

20 25 
Pro Gly Ala Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe 

35 40 
Thr Ser Tyr Ala Met His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu 
50 55 



48 



96 



144 



192 



240 



288 



336 



384 



414 
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Gla Trp Met «, «p XI. « »U o ly M. 01, « ~ ^ ~ 

"» » «- «v ~ «- 118 T 55 Mp Tht s " "t s " 

IM >U T,, ~ « ~ « «. ~ I- -» S5 "* 

100 

v T,r C. AU U, .1, ~T 55 * S « S 

115 - LZU 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser 

130 ^ 
(2) INFORMATION FOR SEQ ID NO: 18: 

,1, SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

* (A) NAME/KEY: CDS 

(B) LOCATION: 1..540 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

s Si a a a a s s a a a a a = « = 

140 145 



s s = s a si s a a = I = a " - i 

155 160 

S 5 S = S S S = " i 5 S= S S S a 

175 

s a a a a s a a g a s a a a a = 

190 X " 

s s s a a s 5 a = a a a s - " - 

205 210 

~ « otc « ™ £ « -c s « s s 

Gin Lys Phe Lys Gly Arg Val Tnr um 23Q 
220 ^ 

s s a a a a a a a a I « a a a a 

235 240 



48 



96 
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3 S 5 S i 5 S 5S 5 i S = " S S S 

5! 5 a a a a a a a a a a a a a x° 

1 270 27b 

ss a a a ss a a a a a a a a a a a 

285 290 

s s a a a a a a a a a i a a =x a 



300 

GTG ACG GTG TCG 
Val Thr Val Ser 
315 

(2) INFORMATION FOR SEQ ID NO: 19: 

UV SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

„et LyB cys Thr Trp val lie Leu ™ «« •« »" "» *5 ^ 

1 5 



Val His Ser Oln Val Gin Leu Val Gin Ser 01, Ala Glu Val Lys Lys 

20 25 
Pro o!y Ala Ser val Lys Val ser Cy. Lye «» «J T "' " he 

Thr ser C Trp Met Hie Trp val Aro oln Ala Pro oly oln Ar, Leu 

50 55 
Olu Trp He Oly olu ,1. A.p Pro ser olu ser Asn Thr Asn Tyr Asn 

Oil Lys P he Lys Oly Ar, Val Thr Leu Thr Val Asp He Ser Ala ser 

85 

Thr Ala Tyr Met olu Leu ser ser Leu Are ser Olu Asp Thr Ala val 

100 10b 
Tyr Tyr eg Ala Ar, oly 01, Tyr Aep Gly Trp Asp Tyr Ala He Asp 

T,r Trp Z Oln oly Thr Leu val Thr val ser ser Ala ser Thr Ly. 

130 135 
Oly Pro Ser Val Phe Pro Leu Ala Pro 8 r Ser Lys Ser Thr Ser Gly 



384 



432 



480 



528 



540 
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«, Thr Ala A!, lj. 01, C «-« ~ S5 "» " he Pr ° !S "° 



Val Thr Val Ser 
180 



(2) INFORMATION FOR SEQ ID NO: 20: 

(iV SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

( c ) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..413 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

S S a S 511 a = 3 a i = = - 5 S S 

185 

a = a a a s a a a a a a = i a s 

200 205 

a s a as a a a a s = a 5 1 = a a 
s a a a a s a a.s a a i a a a a 

230 235 
OCT GGC CAG TCT CCA C f CTC CTC ATC TAT 000 ATT TCC AAC AGA TTT 
Pro Gly Gin Ser Pro Gin Leu Leu lie Tyr fciy * 26Q 
245 250 

TCT OOO OTO CCA OAC AGO TTC ACT GGC ACT GOT TCA GOO ACA OAT TTC 

Ser Gly Val Pro Asp Arg Phe Ser Gly ser uiy o 2?5 

265 

s a a s a a a a a a a a a a a a 

280 2Sb 

a a a a a a a a a a a a a a a a 

295 300 . 

GTG GAA ATA AAA COG OCT GAT GCG GCG CC 
Val Glu lie Lys Arg Ala Asp Ala Ala Pro 
310 315 



48 
96 

144 

192 

240 

288 

336 

384 

413 



(2) INFORMATION FOR SEQ ID NO: 2 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2U 



Met Lys Leu Pro Val Arg Leu Leu Val Leu Leu Leu Phe Trp lie Pro 

1 5 
Val Ser Gly Gly Asp Val Val Met Thr Gin Ser Pro Leu Ser Leu Pro 

20 25 
Val Thr Pro Gly Glu Pro Ala Ser lie Ser Cys Arg Ser Ser Gin Ser 

35 40 
Leu Ala Lys Ser Tyr Gly Asn Thr Tyr Leu Ser Trp Tyr Leu Gin Lys 

50 55 
Pro Gly Gin Ser Pro Gin Leu Leu lie Tyr Gly He Ser Asn Arg Phe 



65 



70 



Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe 

85 yu 

Thr Leu Lys lie Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr 

100 iUS 

Cys Leu Gin Gly Thr His Gin Pro Tyr Thr Phe Gly Gin Gly Thr Lys 

115 120 iZ3 

Val Glu He Lys Arg Ala Asp Ala Ala Pro 
130 "5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TTTCCGGAGG TGATGTTGTG ATGACTCAAA GTCCACTCTC CCTGCCTGTC ACCCCTGGAG 60 

94 

AACCAGCTTC TATCTCTTGC AGGTCTAGTC AGAG 
(2) INFORMATION FOR SEQ ID NO: 23: 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
ACTGGCCAGG CTTCTGCAGG TACCAAGACA AATAGGTGTT CCCATAACTC TTTGCAAGAC 
TCTGACTAGA CCTGCAAGAG ATAGAAGCTG GTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCTGGCCAGT CTCCACAGCT CCTCATCTAT GGGATTTCCA ACAGATTTTC TGGGGTGCCA 
GACAGGTTCA GTGGCAGTGG TTC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ACTCGCGAGA TCTTGAGTGT GAAATCTGTC CCTGAACCAC TGCCACTGAA CCTGTCTGGC 
ACCCCAGAAA ATCTGTTGGA AATC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TCTCGCGAGT AGAGGCTGAG GACGTGGGAG TGTATTACTG CTTACAAGGT ACACATCAGC 



CGTACAC 
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(2) INFORMATION FOR SEQ ID NO: 27: 

li» SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:27: 
«C»i» ^OCCCCT «««« CCTTOOTCCC CT=TCCO»C O^COOC, 
GATGTGTACC TTGTAAGCAG TAATAC 
(2) INFORMATION FOR SEQ ID NO: 28: 

ti\ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
ATAAGCTTCG CCATGAAATG CACCTGGGTC ATTCTCTTCT TGGTATCAAC AGCTACAAGT 
GTCCACTCCC AGGTCCAACT AGTGCACCGG TTA 
(2) INFORMATION FOR SEQ ID NO: 29: 

<i> SEQUENCE CHARACTERISTICS: 
1 (A) LENGTH : 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TAACCGGTGC ACTAGTTGGA CCTGGGAGTG GACACTTGTA GCTGTTGATA CCAAGAAGAG 
AATGACCCAG GTGCATTTCA TGGCGAAGCT TAT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CAACTAGTGC AGTCTGGGGC TGAGGTTAAG AAGCCTGGGG CTTCAGTGAA GGTGTCCTGC 
AAGGGTTCTG GCTACACCTT CACCAGC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TAACCGGTAC TCTAGACGTT GGCCAGGCGC CTGCCTCACC CAATGCATCC AGTAGCTGGT 
GAAGGTGTAG CCAGAACCCT TGCAGGAC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CGTCTAGAGT GGATCGGAGA GATTGATCCT TCTGAGAGTA ATACTAACTA CAATCAAAAA 

TTCAAGGGAC GCGTCA 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TAACCGGTGT GCTAGCGGAA ATGTCTACAG TCAATGTGAC GCGTCCCTTG AATTTTTGAT 

TGTAGTTAGT ATTACT 
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(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CCGCTAGCAC AGCCTACATG GAGCTCAGCA GCCTGAGATC TGAGGACACT GCGGTCTACT 
ATTGTGCAAG AGGGGGTTAC GACGGATG 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TCACCGGTGC GGTGACCAGG GTGCCTTGAC CCCAGTAGTC AATAGCATAG TCCCATCCGT 
CGTAACCCCC TCTTGCACAA TAGTAGAC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CTGGTCACCG TCTCCTCAGC CTCCACCAAG GGCCCATCGG TCTTCCCCCT GGCACCCTCC 
TCCAAGAGCA CCTCTGGGGG CACAG 
(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TCACCGGTTC GGGGAAGTAG TCCTTGACCA GGCAGCCCAG GGCCGCTGTG CCCCCAGAGG 
TGCTCTTGGA GGAGGGTGCC AGGGG 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID 
CTGGCCAACG 

(2) INFORMATION FOR SEQ ID NO: 39: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base parrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CACATTGACT GTAGACACTT CCGCTAGCAC AGCC 
(2) INFORMATION FOR SEQ ID NO: 40: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CCGGAGGTGA TGTTGTGGTG ACTC 
(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TAAGCTTCCG CCATGGGATG GAGC 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
GGTGACACTA GTGCCTTGAC CCCAG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TAAGCTTCCG CCATGAAGTT GCCT 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
GGCGCCGCAT CAGCCCGTTT T 
(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

20 

CGGCGCCATC TGTCTTCATC 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

18 

AAGCTTCTAA CACTCTCC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Asp Val Val Val Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Phe Asp 
! 5 10 1 

Gly Gin Val 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asp Val Val Val Thr Gin Thr Pro L u Ser Leu 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Asp Tyr Ala He Asp Tyr Trp Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Asp Val Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu Gly 
1 5 10 15 

Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val His Ser 
20 25 30 

Asn Gly Asn Thr Tyr Leu Glu Trp Tyr Leu Gin Lys Pro Gly Gin Ser 
35 40 45 

Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro 
50 55 60 

Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys lie 
65 70 75 

Ser Arg Val Glu Ala Glu Asp Leu Gly Val Tyr Tyr Cys Phe Gin Gly 
85 90 95 

Thr His Val Pro Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu lie 
100 105 H° 

Lys 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
Asp lie Val Met Thr Gin Ser Pro Leu Ser Leu Pro Val Thr Pro Gly 
1 5 

Glu Pro Ala Ser He Ser Cys Arg Ser Ser Gin Ser Leu Leu Hie Ser 

20 25 
Xaa Asp Gly Asn Asn Tyr Leu Asn Tr P Tyr Leu Gin Lys Pro Gly Gin 



35 40 



Pro Gin Leu Leu lie Tyr Leu Val Ser Asn Arg Ala Ser Gly Val 



Ser Pro Gin Leu beu hb ±y* — 6Q 

50 55 
Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys 
65 70 75 

lie Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr Cys Met Gin 

85 90 

Ala Leu 



Gin Xaa Pro Arg Xaa Thr Phe Gly Gin Gly Thr Lys Val Glu 



100 



He Lys 

) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Asp Val Val Met Thr Gin Ser Pro Leu Ser Leu Pro Val Thr Pro Gly 
1 5 

Glu Pro Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Ala Lys Ser 

20 25 
Tyr Gly Asn Thr Tyr Leu Ser Trp Tyr Leu Gin Lys Pro Gly Gin Ser 

35 40 
Pro Gin Leu Leu lie Tyr Gly He Ser Asn Arg Phe Ser Gly Val Pro 

50 55 
Asp Arg Phe Ser Gly Ser Gly Ser Gly. Thr Asp Phe Thr Leu Lys lie 
65 70 75 

Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr Cys Leu Gin Gly 

85 9U 
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Thr His Gin Pro Tyr Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lya 

100 105 



INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys Pro Gly Ala 
1 5 10 

Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 

Trp Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp He 

— 40 



35 



Gly Arg He Asp Pro Asn Ser Gly Gly Thr Asn Tyr Asn Glu Lys Phe 

50 55 
Lys Ser Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr 

2* Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 

85 yu 

Ala Arg Tyr Tyr Tyr Gly Gly Ser Ser Xaa Xaa Val Tyr Xaa Tyr Trp 

100 1US 
Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser 

2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
1 5 10 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 
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»U He S f r Trp V.! «r, Oln JU Pro ,1, .1. Oly J- «« «P »* 
oly Trp lie Ann Pro Tyr Oly A.n oly MP Thr »™ »* «" WS 

Phe Tin Sly X. val Thr He Thr Ala MP Thr ser Thr ser Thr JJ. 

70 

" r Met Olu Leu ser ser Leu Mo ser Olu MP Thr Ala v.l Tyr Tyr 
eye ,U Arc Ala Pro Gly Tyr 01, ser Oly oly >ly eye Tyr «, .1, 
„p T»r X.a Phe Aep Tyr Trp Oly Oln Oly Thr Leu Val Thr Val Ser 



100 

Asp Tyr Trp 
115 120 

Ser 

2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
Oln Val Gin Leu Val Gin Ser Gly ^ «■» Val Lys Lys Pro oly Ala 



„ t„c rw ser Glv Tyr Thr Phe Thr Ser Tyr 
Ser Val Lys Val Ser Cys Lys Gly Ser Giy iyr 3Q 

20 " 
Trp Met Ri. Trp Val Arg oln Ala Pro oly oln Arc, Leu «1« Trp lie 

Oly Olu lie A.p Pro ser olu ser Aen Thr As„ Tyr Asn oln Ly. Phe 

50 55 
ly. Oly Aro val Thr Leu Thr val Asp He ser Ala ser Thr Ma Tyr 

l Olu Leu S.r s.r Leu Ar, Ser Olu » Thr Ala Val Tyr Tyr Cya 

85 

A la Arg Gly Gly Tyr Asp Gly Trp Asp Tyr Ala lie Asp Tyr Trp Gly 



100 i° 5 

Gin Gly Thr Leu Val Thr Val Ser Ser 
115 12U 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: . 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 30 w aeo - \ 
(D) OTHER INFORMATION: /mod_base- i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CCCAAGCTTC CAGGGRCCAR KGGATARACN GRTGG 
(2) INFORMATION FOR SEQ ID NO: 57: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

32 

CCCAAGCTTA CGAGGGGGAA GACATTTGGG AA 
(2) INFORMATION FOR SEQ ID NO:58: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGGAATTCAT GRAATGSASC TGGGTYWTYC TCTT 
(2) INFORMATION FOR SEQ ID NO: 59: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



34 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
ACTAGTCGAC ATGAAGWTGT GGBTRAACTG GRT 



33 
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( 2 )\ INFORMATION FOR SEQ ID NO: 60: 

^(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CCCAAGCTTA CT^GATGGTG GGAAGATGGA 
(2) INFORMATION\^FOR SEQ ID NO: 61: 

(i) SEQUENck CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE\ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
ACTAGTCGAC ATGGATTTWC ARGTGpAGAT TWTCAGCTT 
(2) INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base j^airs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear \ 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 62 



GGAAGCTTCC ACCATGGATT TCGGACTGGC CC 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



\ 



\ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
GGACTAGTGG TTTGGACGAG CCTGTTG 



